WE CLAIM: 



1, ! A protein derived from an enterically 
transmitted non-A/non-B viral hepatitis agent whose 
5 genome contains a region which is homologous to a 

coding region of the 1.33 kb DNA EcoRI insert present 
in plasmid bTZKFl ( ET1 . 1 ) carried in coli strain BB4 
and having AtcC deposit no. 67717. 

10 2. The\ protein of claim 1, which is encoded by 

a complete coding region within said 1.33 kb EcoRI 
insert . 
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3 . A recombinant protein derived from an 
enterically transmitted nonA/nonB viral hepatitis 
agent whose genome! contains a region which is 
homologous to a cod&ng region of a DNA molecule having 
a first sequence (SHQ ID NO . 1 ) : 

AGACCTGTCC CTGTTGCAGCJ^TTCTACQA CCCTGCCCCG AGCTCGAACA GGGCCTTCTC 60 

tacctgcccc aggagct/ac cacVtgtgat A&IGTCGTAA CATTTGAATT AACAGACATT 120 

GTGCACTGCC GCATCttCCGC CCCGAfcCCAG C3CMGGCCG TGCTGTCCAC ACTCGTGGGC 180 

CGCTACGGCG GTckfACAAA GCTCTA0AAT GCTTCcVcT^TGATGTTCG CGACTCTCTC 240 

Igccat tggccccw^ggttaAa CTTGTGAATT GTACGAGCTA 300 

^GGGCCAGGAT\ GGCTCCGCCa TCCTTGAGCT TGATCTTTGC 360 

GAT CACCTTCTTC QAGAAAGATT 1GTAACAAGTT CACCACAGGT 420 

GAGACCATTG CCCATcbmA AGTGGGCCAG GgYaTCTCGG |CTGGAGCAA GACCTTCTGC 480 

GCCCTCTTTG GCCCTTGGf T CCGCGCTATT GAG/UgGCTA ^TCTGGCCCT GCTCCCTCAG 540 

GGTGTGTTTT AGGGTGATGC CTTTGATGAC ACCGTCTTCT CGGCGGCTGT GGCCGCAGCA 600 

AAGGCATCCA TGGTGTTTGA GAATGACTTT TCTGAGWTG ACTCCACCCA GAATAACTTT 660 

TCTCTGGGTC TAGAGTGTGC TATTATGGAG GAGTGTGGfiA TGCCGCAGTG GCTCATCCGC 720 

CTGTATCACC TTATAAGGTC TGCGTGGATC TTGCAGGCCGL CGAAGGAGTC TCTGCGAGGG 780 

TTTTGGAAGA AACACTCCGG TGAGCCCGGC ACTCTTCTAT \gaATACTGT CTGGAATATG 840 

GCCGTTATTA CCCACTGTTA TGACTTCCGC GATTTTCAGG TGGCTGCCTT TAAAGGTGAT 900 



GCCCGTTTTA TCCC 
GTGGAGGCCA TGGTC 
AACCGTGACG TGTCd 
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GATTCGATAG TGyTTGCAG TGAGTATCGT CAGAGTCCAG GAGCTGCTGT CCTGATCGCC 

GGCTGTGGCT TGA^GTTGAA GGTAGATTTC CGCCCGATCG GTTTGTATGC AGGTGTTGTG 

gtggcccccg gcctAgcgc GCTCCCTGAT GTTGTGCGCT TCGCCGGCCG GCTTACCGAG 

AAGAATTGGG GCCCTGGfeCC TGAGCGGGCG GAGCAGCTCC GCCTCGCTGT TAGTGATTTC 

CTCCGCAAGC TCACGAATGT AGCTCAGATG TGTGTGGATG TTGTTTCCCG TGTTTATGGG 

GTTTCCCCTG GACTCGTTcA TAACCTGATT GGCATGCTAC AGGCTGTTGC TGATGGCAAG 

GCACATTTCA CTGAGTCAGT WaCCAGTG CTCGA 

a second sequence (SEiQ ID NO. 5): 

TCGAGCACTG GTTTTACTGA CTCAGTGAAA TGTGCCTTGC CATCAGCAAC AGCCTGTAGC 

ATGCCAATCA GGTTATGAAC GAGTCCAGGG GAAACCCCAT AAACACGGGA AACAACATCC 

ACACACATCT GAGCTACATT CGTGAGCTTG CGGAGGAAAT CACTAACAGC GAGGCGGAGC 

TGCTCCGCCC GCTCAGGGCC AGGGCSCCAA TTCTTCTCGG TAAGCCGGCC GGCGAAGCGC 

ACAACATCAG GGAGCGCGCC AAGGpOsGGG GCCAtfSACAA CACCTGCATA CAAACCGATC 

GGGCGGAAAT CTACCTTCAA CWCAAgIcA CAGCCGGDSA TCAGGACAGC AGCTCCTGGA 



CTCTGACGAT ACTCACTGCA 

AAATCGCGGA AGTCATAACA jaTGGGTAAT^ ACGGCCA 
f 

AGAGTGCCGG GCTCACCGdjA 



TGCAAGATCC ACGCAGACCfT 



AGCACTAf C GAATCATCAC\CTTTAAA13GC AGCCACCTGA 
CAGACAGT ATTCCATAGA 
STGTTTCTlU^AAACCCTC GCAGAGACTC CTTCGGGGCC 
AGGTGA VaCAGGCGGA TGA&CCACTG CGGCATCCCA 



CACTCCTCCA TAATAGCACf STCTAGACCC AGJAGAAAAGT TATTGJGGGT GGAGTCAAAC 
TCAGAAAAGT CATTCTCAA^ C^CCATGGAT GC^TTTGCTG CGGCC^AGC CGCCGAGAAG 
ACGGTGTCAT CAAAGGCATcUt$GTAAAAC ACA^CCTGAG GGAGCAGGGC CAGAATAGCC 
TTCTCAATAG CGCGGAACCA /\^G(^CAAAG AGGG^CAGA AGGTCTTGCT CCAGGCCGAG 
ATGCCCTGGC CCACTTTACC ATGGGCAATG GTCTC/CCTG TGGTGAACTT GTTACAATCT 
TTCTGGAAGA AGGTGATCCT GGACACGTCA CGGTTGCMA GATCAAGCTC AAGGACGGCG 
GAGCCATCCT GGCCCTTCTC GACCATGGCC TCCACTAGCT CGTACAATTC ACAAGTTGTA 
ACCTGTACGG GGCCAATGGC CGGGATAAAA CGGGCGAGAG AGTCGCGAAC ATCAGAGTGG 
GAAGCATTGT AGAGCTTTGT GCGACCGCCG TAGCGGCCCA CGAGTGTGGA CAGCACGGCC 
TTGCGCTGGC TCGGGGCGGC CATGCGGCAG TGCACAATGT CTGTTAATTC AAATGTTACG 



960 
1020 
1080 
1140 
1200 
1260 
1295 

60 
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240 
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ACACTATOC AGGTGGTGAG CTCCTGGGGC AGGTAGAGAA GGCCCTGTTC GAGCTCGGGG 

CAGGGTGGYA GAACAGCTGC AACAGGGACA GGTCT 

a thirql sequence (SEQ ID NO . 6 ) : 
aggcagaccaVatatgtggt CGATGCC ATGGAGGCCC ATCAGTTTAT TAAGGCTCCT 

ggcatcacA cVgctattga GCAGGCTGCT CTAGCAGCGG CCAACTCTGC CCTGGCGAAT 

GCTGTGGTAG VtAGGCCTTT TCTCTCTCAC CAGCAGATTG AGATCCTCAT TAACCTAATG 

caacctcgcc a\c!tgtttt CCGCCCCGAG GTTTTCTGGA ATCATCCCAT CCAGCGTGTC 

ATCCATAACG AGCTGGAGCT TTACTGCCGC GCCCGCTCCG GCCGCTGTCT TGAAATTGGC 

GCCCATCCCC GCTCAATAAA TGATAATCCT aatgtggtcc accgctgctt cctccgccct 

GTTGGGCGTG ATGTTGAM CTGGT^TACT GCTCCCACTC GCGGGCCGGC TGCTAATTGC 

fx \ \ 

CGGCGTTCCG CGCTJSCGpSi GCTTCCCGQT GCTGACCGCA CTTACTGCCT CGACGGGTTT 
TCTGGCTGTA A£ jTTCCOGC \cGAGACTGGc\aTCGCCCTCT ACTCCCTTCA TGATATGTCA 



CCATCTGATG T| 
CATCTTCCGC C" 



iCCGAGGa CwTGTTCCGC QATGGTATGA CGCGGCTCTA TGCCGCCCTC 

aAGGTccT \c\gccccct ggVacatatc GCACCGCATC GTATTTGCTA 



ATTCATGACG GT|GGCGCGT U^TGACG TATfcAGGGTG ATACTAGTGC TGGTTACAAC 
CACGATGTCT CC4WTTTGCG CTC&TGGATT AGAAECACCA AGGTTACCGG AGACCATCCC 
CTCGTTATCG AGCppGTTAG GGCC^TTGGC TGCCACTTTG TTCTCTTGCT CACGGCAGCC 

CCGGAGCCAT CAC&fATGCC TTATGT|CCT TACCCCCGGT CTACCGAGGT CTATGTCCGA 

\ 1 , 

TCGATCTTCG GCCCGGGTGG CACCCCtIcC TTATTCCCAA CCTCATGCTC CACTAAGTCG 
ACCTTCCATG CTGTCCCTGC CCATATTTGfi GACCGTCTTA TGCTGTTCGG GGCCACCTTG 
GATGACCAAG CCTTTTGCTG CTCCCGTT nAMTGACCTACC TTCGCGGCAT TAGCTACAAG 
GTCACTGTTG GTACCCTTGT GGCTAATGAA &CTGGAATG CCTCTGAGP* CGCCCTCACA 
GCTGTTATCA CTGCCGCCTA CCTTACCATtHgVcACCAGC GGTATCTCCG CACCCAGGCT 
ATATCCAAGG GGATGCGTCG TCTGGAACGG gIgCATGCCC AGAAGTTTAT AACACGCCTC 
TACAGCTGGC TCTTCGAGAA GTCCGGCCGT gItTACATCC CTGGCCGTCA GTTGGAGTTC 
TACGCCCAGT GCAGGCGCTG GCTCTCCGCC GG|Tt\catC TTGATCCACG GGTGTTGGTT 
TTTGACGAGT CGGCCCCCTG CCATTGTAGG ACCGCGATCC GTAAGGCGCT CTCAAAGTTT 
TGCTGCTTCA TGAAGTGGCT TGGTCAGGAG TGCACCTGCT TCCTTCAGCC TGCAGAAGGC 
GCCGTCGGCG ACCAGGGTCA TGATAATGAA GCCTATGAG& GGTCCGATGT TGACCCTGCT 
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777 
837 
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6AGTCCGCCA ijTTAGTGACAT ATCTGGGTCC TATGTCGTCC CTGGCACTGC CCTCCAACCG 

CTCTACCAG(L iSCCTCGATCT CCCCGCTGAG ATTGTGGCTC GCGCGGGCCG GCTGACCGCC 

ACAGTAAAGG VcTCCCAGGT CGATGGGCGG ATCGATTGCG AGACCCTTCT TGGTAACAAA 

ACCTTTCGCA COTCGTTCGT TGACGGGGCG GTCTTAGAGA CCAATGGCCC AGAGCGCCAC 

AATCTCTCCT TCGA\TGCCAG TCAGAGCACT ATGGCCGCTG GCCCTTTCAG TCTCACCTAT 

GCCGCCTCTG CAGCTGGGCT GGAGGTGCGC TATGTTGCTG CCGGGCTTGA CCATCGGGCG 

GTTTTTGCCC CCGGTGtVtC ACCCCGGTCA GCCCCCGGCG AGGTTACCGC CTTCTGCTCT 

GCCCTATACA GGTTTAACCGy TGAGGCCCAG CGCCATTCGC TGATCGGTAA CTTATGGTTC 

CATCCTGAGG GACTCATTGG CCJ«TO5e6s^CGTTTTCGC CCGGGCATGT TTGGGAGTCG 

GCTAATCCAT TCTGTGGCG^T GAG£ACACTT TACAQCCGTA CTTGGTCGGA GGTTGATGCC 

GTCTCTAGTC CAGCCCGMCC TGACTTAGGT TTTATG7£TG AGCCTTCTAT ACCTAGTAGG 



GCCGCCACGC CTACCCT 



5GC GGCCCdTCTA CCCCCCCCte CACCGGACCC TTCCCCCCCT 



CCCTCTGCCC CGGCGCTFGC TGAGCCGGCT TCTGGCGj&TMCCGCCGGGGC CCCGGCCATA 
ACTCACCAGA CGGCCCGiCA CCGCCGCP*S^TCTTCACCT ACCCGGATGG CTCTAAGGTA 

ttcgccggct cgctgttcq/TgtcgacatgcXacgtggctcg TTAACGCGTC TAATGTTGAC 

CACCGCCCTG GCGGCGGGCf TTGCCATGCA \tTTACCAAA GGTACCCCGC CTCCTTTGAT 

GCTGCCTCTT TTGTGATGC$ CGACGGCGCG GflfGCGTACA CA^TAACCCC CCGGCCAATA 

ATTCACGCTG TCGCCCCTGA 1 TTATAGGTTG GAACATAACC CAAAGAGGCT TGAGGCTGCT 

TATCGGGAAA CTTGCTCCCG CCTCGGCACC GCTQCATACC CGCTGCTCGG GACCGGCATA 

TACCAGGTGC CGATCGGCCC CAGTTTTGAC GCCtIgGAGC GGAACCACCG CCCCGGGGAT 

GAGTTGTACC TTCCTGAGCT TGCTGCCAGA TGGTTTIGAGG CCAATAGGCC GACCCGCCCG 

ACTCTCACTA TAACTGAGGA TGTTGCACGG ACAGCGWftTC TGGCCATCGA GCTTGACTCA 

GCCACAGATG TCGGCCGGGC CTGTGCCGGC TGTCGGgVVa CCCCCGGCGT TGTTCAGTAC 

CAGTTTACTG CAGGTGTGCC TGGATCCGGC AAGTCCCGlV CTATCACCCA AGCCGATGTG 

GACGTTGTCG TGGTCCCGAC GCGTGAGTTG CGTAATGCQtI GGCGCCGTCG CGGCTTTGCT 

GCTTTTACCC CGCATACTGC CGCCAGAGTC ACCCAGGGGC WcGGGTTGT CATTGATGAG 

gctccatccc tcccccctca cctgctgctg ctccacatgci aWgggccgc CACCGTCCAC 

CTTCTTGGCG ACCCGAACCA GATCCCAGCC ATCGACTTTG \AGtACGCTGG GCTCGTCCCC 



1617 
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GCCATCAGGC CCGA\CTTAGG CCCCACCTCC TGGTGGCATG TTACCCATCG CTGGCCTGCG 3297 

GATGTATGCG AGCT<VtCCG TGGTGCATAC CCCATGATCC AGACCACTAG CCGGGTTCTC 3357 

CGTTCGTTGT TCTGGG^TGA GCCTGCCGTC GGGCAGAAAC TAGTGTTCAC CCAGGCGGCC 3417 

AAGCCCGCCA ACCCCGGtfcTC AGTGACGGTC CACGAGGCGC AGGGCGCTAC CTACACGGAG 3477 

ACCACTATTA TTGCCACAGi AGATGCCCGG GGCCTTATTC AGTCGTCTCG GGCTCATGCC 3537 

ATTGTTGCTC TGACGCGCCA^CACTGAGAAG TGCGTCATCA TTGACGCACC AGGCCTGCTT 3597 

CGCGAGGTGG GCATCTCCGA \\cAATCGTT AATAACTTTT TCCTCGCTGG TGGCGAAATT 3657 

GGTCACCAGC GCCCATCAGT T^CCCCGT GGCAACCCTG ACGCCAATGT TGACACCCTG 3717 

GCTGCCTTCC CGCCGTCTTG CC*UT TAGT GCCTTCCATC AGTTGGCTGA GGAGCTTGGC 3777 

CACAGACCTG TCCCTGTTGC AGCTf^CTA>S4CCCTGCC CCGAGCTCGA ACAGGGCCTT 3837 

CTCTACCTGC CCCAGGAGCT CCA&ClteT GATAGtGTCG TAACATTTGA ATTAACAGAC 3897 

ATTGTGCACT GCCGCATGGC /cGCCCCQAg\ CAGCGCAA&G CCGTGCTGTC CACACTCGTG 3957 

GGCCGCTACG GCGGTCGCACJ AAAGCTCt\c v^ATGCTTCC(\aCTCTGATGT TCGCGACTCT 4017 

CTCGCCCGTT TTATCCCGGClfcATTGGCCCft GWCAGGTIAs^S^GI^ATTGTACGAG 4077 

CTAGTGGAGG CCATGGTCGA ^AGGGCCAGj^CCTCCG CC&JCCTTGA GCTTGATCTT 4137 

TGCAACCGTG ACGTGTCCAG J^^OTC ^CCA&AAAG ATTGtAACAA GTTCACCACA 4197 

GGTGAGACCA TTGCCCATGG TAARGTGGGC CaIggQATCT CGGCC1GGAG CAAGACCTTC 4257 

TGCGCCCTCT TTGGCCCTTG GTICCGCGCT ATTGMAWGG CTATTCTSGC CCTGCTCCCT 4317 

CAGGGTGTGT TTTACGGTGA TGCOTFTGAT GACAC^TOT TCTCGGCGGC TGTGGCCGCA 4377 

GCAAAGGCAT CCATGGTGTT TGAGAATGAC TTTTCTGAG1 TTGACTCCAC CCAGAATAAC 4437 

TTTTCTCTGG GTCTAGAGTG TGCTATTATG GAGGAGTGTGAgGATGCCGCA GTGGCTCATC 4497 

CGCCTGTATC ACCTTATAAG GTCTGCGTGG ATCTTGCAGG &CCCGAAGGA GTCTCTGCGA 4557 

GGGTTTTGGA AGAAACACTC CGGTGAGCCC GGCACTCTTC tAtGGAATAC TGTCTGGAAT 4617 

ATGGCCGTTA TTACCCACTG TTATGACTTC CGCGATTTTC AG&TGGCTGC CTTTAAAGGT 4677 

GATGATTCGA TAGTGCTTTG CAGTGAGTAT CGTCAGAGTC CAGGjAGCTGC TGTCCTGATC 4737 

GCCGGCTGTG GCTTGAAGTT GAAGGTAGAT TTCCGCCCGA TCGGWTGTA TGCAGGTGTT 4797 

GTGGTGGCCC CCGGCCTTGG CGCGCTCCCT GATGTTGTGC GCTTCGlCGG CCGGCTTACC 4857 

GAGAAGAATT GGGGCCCTGG CCCTGAGCGG GCGGAGCAGC TCCGCCTTGC TGTTAGTGAT 4917 
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TTCCTCCGCA AGCTCACGAA TGTAGCTCAG ATGTGTGTGG ATGTTGTTTC CCGTGTTTAT 4977 

GGGGTTTCCCXCTGGACTCGT TCATAACCTG ATTGGCATGC TACAGGCTGT TGCTGATGGC 5037 

AAGGCACATT TCACTGAGTC AGTAAAACCA GTGCTCGACT TGACAAATTC AATCTTGTGT 5097 

CGGGTGGAAT GaYtaaCATGTC TTTTGCTGCG CCCATGGGTT CGCGACCATG 5149 

CGCCCTCGGC CTATt\tGTT GCTGCTCCTC ATGTTTTTGC CTATGCTGCC CGCGCCACCG 5209 

CCCGGTCAGC CGTCTGGOfG CCGTCGTGGG CGGCGCAGCG GCGGTTCCGG CGGTGGTTTC 5269 

TGGGGTGACC GGGTTGATT& TCAGCCCTTC GCAATCCCCT ATATTCATCC AACCAACCCC 5329 

TTCGCCCCCG ATGTCACCGC \gCGGCCGGG GCTGGACCTC GTGTTCGCCA ACCCGCCCGA 5389 

CCACTCGGCT CCGCTTGGCG TGACCAGGCC CAGCGCCCCG CCGTTGCCTC ACGTCGTAGA 5449 

CCTACCACAG CTGGGGCCGC GCCgVtaA CCGCGGTCGC TCCGGCCCAT GACACCCCGC 5507 

CAGTGCCTGA TGTCGACTCC CGC^ckcCA ThjGCGCCG GCAGTATAAC CTATCAACAT 5567 

CTCCCCTTAC CTCTTCGGTG flfCCACCGGtA CTAACkTGGT TCTTTATGCC GCCCCTCTTA 5627 

I / \ \ 

GTCCGCTTTT ACCCCTTCAff GACGGCACcA ATACCCaW AATGGCCACG GAAGCTTCTA 5687 

ATTATGCCCA GTACCGGGW GCCCGTGCCA WtCCGJJA CCGCCCGCTG GTCCCCAATG 5747 

CTGTCGGCGG TTACGCCAJCjl^^ ACCCCGACGT 5807 

CCGTTGATAT GAATTCAATM ACCTCGACGG ATGTTCGTAT ITTAGTCCAG CCCGGCATAG 5867 

CCTCTGAGCT TGTGATCCcAlAGTGAGCGCC TACAOTATCG TAACCAAGGC TGGCGCTCCG 5927 

TCGAGACCTC TGGGGTGGCtIgaGGAGGAGG CTACCTTTGG TO|TGTTATG CTTTGCATAC 5987 

ATGGCTCACT CGTAAATTCC TATACTAATA CACCCTATAC CGGTGCCCTC GGGCTGTTGG 6047 

ACTTTGCCCT TGAGCTTGAG TTTCGCAACC TTACCCCCGG TAACACCAAT ACGCGGGTCT 6107 

CCCGTTATTC CAGCACTGCT CGCCACCGCC TTCGTCGCGG\ TGCGGACGGG ACTGCCGAGC 6167 

TCACCACCAC GGCTGCTACC CGCTTTATGA AGGACCTCTA 1TTTACTAGT ACTAATGGTG 6227 

TCGGTGAGAT CGGCCGCGGG ATAGCCCTCA CCCTGTTCAA CCTTGCTGAC ACTCTGCTTG 6287 

GCGGCCTGCC GACAGAATTG ATTTCGTCGG CTGGTGGCCA GCTftTTCTAC TCCCGTCCCG 6347 

TTGTCTCAGC CAATGGCGAG CCGACTGTTA AGTTGTATAC ATCT^TAGAG AATGCTCAGC 6407 

AGGATAAGGG TATTGCAATC CCGCATGACA TTGACCTCGG AGAATCtCGT GTGGTTATTC 6467 

AGGATTATGA TAACCAACAT GAACAAGATC GGCCGACGCC TTCTCCA6CC CCATCGCGCC 6527 

CTTTCTCTGT CCTTCGAGCT AATGATGTGC TTTGGCTCTC TCTCACCGOT GCCGAGTATG 6587 
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ACCAGTCCAC TTATGGCTCT TCGACTGGCC CAGTTTATGT TTCTGACTCT GTGACCTTGG 

TTAATGTTGG GACCGGCGCG CAGGCCGTTG CCCGGTCGCT CGATTGGACC AAGGTCACAC 

ttgacggtcg\ccccctctcc ACCATCCAGC AGTACTCGAA GACCTTCTTT GTCCTGCCGC 

TCCGCGGTAA GtTCTCTTTC TGGGAGGCAG GCACAACTAA AGCCGGGTAC CCTTATAATT 
\ 

ATAACACCAC TGtoAGCGAC CAACTGCTTG TCGAGAATGC CGCCGGGCAC CGGGTCGCTA 
TTTCCACTTA CACOXtTAGC CTGGGTGCTG GTCGCGTCTC CATTTCTGCG GTTGCCGTTT 
TAGCCCCCCA CTCTQCfeCTA GCATTGCTTG AGGATACCTT GGACTACCCT GCCCGCGCCC 
ATACTTTTGA TGATTTCTGC CCAGAGTGCC GCCCCCTTGG CCTTCAGGGC TGCGCTTTCC 
AGTCTACTGT CGCTGAg\t\ CAGCGCCTTA AGATGAAGGT GGGTAAAACT CGGGAGTTGT 
AG TTTATTTGCT TGTGC(\CC\C IT C TT JjlX^LJG C TT ATTTC TCATTTCTGC 
GTTCCGCGCT CCCTGA 

a fourth sequerfce Y( SEQ ID NO.&D): 
GCCATGGAGG CCCACCaWt/ CATTaXgGCT CCTGGCATCA CtXcTGCTAT TGAGCAAGCA 



GCTCTAGCAG CGGCCAAAfG CGQCCTTiCG AATGCTGTGG, 



£GGCC tttcctttcc 



CATCAGCAGG TTGAGATCDH TATAR ATCJ 



GOTfCCTC GGCAGC nGGT GTTTCGTCCT 



GAGGTTTTTT GGAATCAC^gM>TtAACGT\ GTTATACATA ATGAGCTTGW GCAGTATTGC 
CGTGCTCGCT CGG&?^rG \| CTTGA^ATT \gAGCCCACC CACGCTCCaA TAATGATAAT 
CCTAATGTCC TCCATCGCTGWTTCTCCAC CaCGTCGGCC GGGAJGTTCA, QCGCTGGTAC 
ACAGCCCCGA CTAGGGGACC \pCGGCGAACyGTfcGCCGCT CGGCACTTCG TGGTCTGCCA 
CCAGCCGACC GCACTTACTG TTTTGATGGC tVtgVcGGCT GCCGTTTTGC CGCCGAGACT 
GGTGTGGCTC TCTATTCTCT CCATGACTTG CAGSxVsCTG ATGTTGCCGA GGCGATGGCT 
CGCCACGGCA TGACCCGCCT TTATGCAGCT TTCCAG^TGC CTCCAGAGGT GCTCCTGCCT 
CCTGGCACCT ACCGGACATC ATCCTACTTG CTGATCC^CG ATGGTAAGCG CGCGGTTGTC 
ACTTATGAGG GTGACACTAG CGCCGGTTAC AATCATGATG TTGCCACCCT CCGCACATGG 
ATCAGGACAA CTAAGGTTGT GGGTGAACAC CCTTTGGTGa\cGAGCGGGT GCGGGGTATT 
GGCTGTCACT TTGTGTTGTT GATCACTGCG GCCCCTGAGC c\tCCCCGAT GCCCTACGTT 

ccttacccgc gttcgacgga ggtctatgtc cggtctatct ttcIggcccgg CGGGTCCCCG 

TCGCTGTTCC CGACCGCTTG TGCTGTCAAG TCCACTTTTC ACGCCGTCCC CACGCACATC 
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TGGGACCGTC 
CTTATGACGT 
GAAGGCTGGA 
ATATGTCATC 
CTTGAACATG 
CGTGATTACA 
GCCGGGTTCC 
CGAACCACCA 
GAGTGTTCTT 
GAGGCCTATG 
TCATACATCG 
GACCTGGTAG 
CGTCTGGATT 
GCACGCCTTG 
AGTATGGCAG 
CATTTTTCCA 
GCCCCGCCGA 
CAGCGCCAGT 
CCGCCCTTTT 
CTCTACACCC 
GGTCATTTGG 
GTAGGCTCGT 
CCCTCTGGGG 
CACACCTACC 
TGGCTTGTCA 
TTTCAGCGTT 
GCGTATACCC 
CATAACCCCA 



iTCATGCTCTT TGGGGCCACC CTCGACGACC AGGCCTTTTG CTGCTCCAGG 
AtCTTCGTGG CATTAGCTAT AAGGTAACTG TGGGTGCCCT GGTCGCTAAT 
ATGCCACCGA GGATGCGCTC ACTGCAGTTA TTACGGCGGC TTACCTCACA 
AGCOTTATTT GCGGACCCAG GCGATTTCTA AGGGCATGCG CCGGCTTGAG 
CTCAGAAATT TATTTCACGC CTCTACAGCT GGCTATTTGA GAAGTCAGGT 
tcccag\ccg CCAGCTGCAG TTCTACGCTC agtgccgccg CTGGTTATCT 
ATCTCGACCC CCGCACCTTA GTTTTTGATG AGTCAGTGCC TTGTAGCTGC 
TCCGGCGGA \ CGCTGGAAAA TTTTGCTGTT TTATGAAGTG GCTCGGTCAG 
GTTTCCTCCA GCCCGCCGAG GGGCTGGCGG GCGACCAAGG TCATGACAAT 
AAGGCTCTGA TGfTGATACT GCTGAGCCTG CCACCCTAGA CATTACAGGC 
TGGATGGTCG GTCTCT&e£A ACTGTCTXTQ. AAGCTCTCGA CCTGCCAGCT 
CTCGCGCAGC CCSaAgTCT GCTACAGTTA CTfitfTACTGA AACCTCTGGC 



GCCAAACAAT 



ATCGGCAAT AAGACTTTTC TCACTACCTT TGTTGATGGG 



AGGTTAACgG GCCTGAGCAG CTTAACCTCT CTTTTGAOAG 



CAGTGT 



CCGGCCCGTti T^TGCCTCACC \TATGCTGCCG JM#TGGCG& GCTGGAAGTT 
CCGCTGGCCTlCtiiAGAGCCGT^fi^TdTTTTCC CCCCWAA \gCCCCGACT 
GTGAGGTCAC \(?6CTTCTGC TCAGCTCTTT ATAGGCACAA CC&GCAGAGC 
CGGTTATTGG TAGtTTGTGG CTGOACCCTG AAGGTTTGCT CGGaCTGTTC 
CACCCGGGCA TGAGIGGCGG TCTGCTAACC CATTTTGCGG CGAGAGCACG 
GCACTTGGTC CACAAfTACA GACACA&CCT TAACTGTCGG GCTAATTTCC 
ATGCTGCTCC CCACTCGGGG GGGCCACCfG CTACTGCCAC AGGCCCTGCT 
CTGACTCTCC AGACCCTGAC CCGCTACCTG ATGTTACAGA TGGCTCACGC 
CCCGTCCGGC TGGCCCCAAC CCGAATGGCG VtCCGCAGCG CCGCTTACTA 
CTGACGGCGC TAAGATCTAT GTCGGCTCCA TTTTCGAGTC TGAGTGCACC 
ACGCATCTAA CGCCGGCCAC CGCCCTGGTG GC^GGCTTTG TCATGCTTTT 
ACCCTGATTC GTTTGACGCC ACCAAGTTTG TGATGCGTGA TGGTCTTGCC 
TTACACCCCG GCCGATCATT CATGCGGTGG CCCCGGACTA TCGATTGGAA 
AGAGGCTCGA GGCTGCCTAC CGCGAGACTT GCGCCCGCCG AGGCACTGCT 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
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TCAGTATTCT CTGcVgCCGT GGCTGGCGCC AGCCATGCCA TGGTGTTTGA AAATGATTTT 4380 

TCTGAGTTTG ACTCGIftCTCA GAATAACTTT TCCCTAGGTC TTGAGTGCGC CATTATGGAA 4440 

GAGTGTGGTA TGCCCcVgTG GCTTGTCAGG TTGTACCATG CCGTCCGGTC GGCGTGGATC 4500 

CTGCAGGCCC CAAAAGAGTC TTTGAGAGGG TTCTGGAAGA AGCATTCTGG TGAGCCGGGC 4560 

AGCTTGCTCT GGAATACGGT GTGGAACATG GCAATCATTG CCCATTGCTA TGAGTTCCGG 4620 

GACCTCCAGG TTGCCGCCTI CAAGGGCGAC GACTCGGTCG TCCTCTGTAG TGAATACCGC 4680 

CAGAGCCCAG GCGCCGGTTcWtTATAGCA GGCTGTGGTT TGAAGTTGAA GGCTGACTTC 4740 

15 CGGCCGATTG GGCTGTATGC CGGGGTTGTC GTCGCCCCGG GGCTCGGGGC CCTACCCGAT 4800 

GTCGTTCGAT TCGCCGGACG GcVtTCGGAG AAGAACTGGG GGCCTGATCC GGAGCGGGCA 4860 

GAGCAGCTCC GCCTCGCCGT GCAGGATTTC CTCCGTAGGT TAACGAATGT GGCCCAGATT 4920 

20 \ 

TGTGTTGAGG TGGTGTCTAG AGTTfAC^G^fT^CCCCGG GTCTGGTTCA TAACCTGATA 4980 

GGCATGCTCC AGACTATTQG TGATOSTAAG GCGCAT^A CAGAGTCTGT TAAGCCTATA 5040 

25 CTTGACCTTA CACACT|AAT TATGCAKCGG TCTGAATGAA^^ATGTGGJTFGCTGCGC 5100 

CCATGGGTTC GCCACCATGC ^ZZl^mXZJ^f^&^^^miZZlZl TGTTTCTGCC 5160 

TATGTTGCCC GCGCCA6CGA a^gJeAg^erfrGGCCGC CGTOGTGGGC GGCGCAGCGG 5220 

CGGTACCGGC GGTG^#O^GG^GTGACCa GGTTGATTCT CAGCOCTTCG CAATCCCCTA 5280 

TATTCATCCA ACCAACCcWi TTGCCCCAGA VgTTGCCGCT GCGTCCGGGT CTGGACCTCG 5340 

4; 35 CCTTCGCCAA CCAGCCCGg\ |cACTTGGCTC c\dTGGCGA GATCAGGCCC AGCGCCCCTC 5400 

CGCTGCCTCC CGTCGCCGAC "CTGCCACAGC CG&GGCTGCG GCGCTGAC&G CTGTGGCGCC 5460 



30 



40 



TGCCCATGAC ACCTCACCCG TCCCGGACGT TGA1TCTCGC GGTGCAATTC TACGCCGCCA 5520 

GTATAATTTG TCTACTTCAC CCCTGACATC CTCT&TGGCC TCTGGCACTA ATTTAGTCCT 5580 

GTATGCAGCC CCCCTTAATC CGCCTCTGCC GCTGCAGGAC GGTACTAATA CTCACATTAT 5640 

45 GGCCACAGAG GCCTCCAATT ATGCACAGTA CCGGGTTGCC CGCGCTACTA TCCGTTACCG 5700 

GCCCCTAGTG CCTAATGCAG TTGGAGGCTA TGCTATATTC ATTTCTTTCT GGCCTCAAAC 5760 

AACCACAACC CCTACATCTG TTGACATGAA TTCCATTAcV TCCACTGATG TCAGGATTCT 5820 
50 \ 

TGTTCAACCT GGCATAGCAT CTGAATTGGT CATCCCAAGQ GAGCGCCTTC ACTACCGCAA 5880 

TCAAGGTTGG CGCTCGGTTG AGACATCTGG TGTTGCTGAG \gAGGAAGCCA CCTCCGGTCT 5940 

55 TGTCATGTTA TGCATACATG GCTCTCCAGT TAACTCCTAT ACCAATACCC CTTATACCGG 6000 
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TGCCCTTGGC 1TACTGGACT TTGCCTTAGA GCTTGAGTTT CGCAATCTCA CCACCTGTAA 
CACCAATACA CGTGTGTCCC GTTACTCCAG CACTGCTCGT CACTCCGCCC GAGGGGCCGA 
CGGGACTGCG GAlSCTGACCA CAACTGCAGC CACCAGGTTC ATGAAAGATC TCCACTTTAC 
CGGCCTTAAT GGGGTAGGTG AAGTCGGCCG CGGGATAGCT CTAACATTAC TTAACCTTGC 
TGACACGCTC CTCOBCGGGC TCCCGACAGA ATTAATTTCG TCGGCTGGCG GGCAACTGTT 
TTATTCCCGC CCGGTTGTCT CAGCCAATGG CGAGCCAACC GTGAAGCTCT ATACATCAGT 
GGAGAATGCT CAGCAG&ATA AGGGTGTTGC TATCCCCCAC GATATCGATC TTGGTGATTC 
GCGTGTGGTC ATTCAGgAtT ATGACAACCA GCATGAGCAG GATCGGCCCA CCCCGTCGCC 
TGCGCCATCT CGGCCTTTW CTGTTCTCCG AGCAAATGAT GTACTTTGGC TGTCCCTCAC 
TGCAGCCGAG TATGACCAgA CCACTTACGG GTCGTCAACT GGCCCGGTTT ATATCTCGGA 
CAGCGTGACT TTGGTGAATG TTGCGACT6G CGCGCAGGCC GTAGCCCGAT CGCTTGACTG 
GTCCAAAGTC ACCCTCGACG G&CGGCCCCT CCCGACTGTT GAGCAATATT CCAAGACATT 
CTTTGTGCTC CCCCTTCGTG GC^GCTCTC^XJTTTGGGAG GCCGGCACAA CAAAAGCAGG 
TTATCCTTAT AATTATAATA QtA^TGCTAG TGACQGATT CTGATTGAAA ATGCTGCCGG 
CCATCGGGTC GCCATTiTCA^ OtTAltoCCAC CAGGCTlteGG GCCGGTCCGG TCGCCATTTC 

CACGCTCCGC CCTGGCTGfG CTGGAGGATA CTTTTGATTA 

\ \ ^ 

ttgatgaltt ctgccctgaWgccgcgctt TAGGCCTCCA 

CTGTCGCmA^GtTCCAGCGC\CTTAAAGTTA AGGTGGGTAA 
TTTGGCTGTb CCCACCTACT \aTATCTGCT GATTTCCTTT 

CGCTCCCTG\A 



TGCGGCCGCG GTTTTGGCT 
TCCGGGGCGG GCGCACACAjT 
GGGTTGTGCT TTCCAGTCA, 
AACTCGGGAG TTGTAGTffJ 
ATTTCCTTTT TCTCGGTCCC 



or a fifth sequence |SEQ ID1N0-1 2 ) : 

CGGGCCCCGT ACAGGTCACA |CCTGTGAGT TyrACGAGCT AGTGGAGRCC ATGGTCGAGA 

AAGGCCAGGA TGGCTCCGCC GTCCTTGAGC TCSATCTCTG CAACCGTGAC GTGTCCAGGA 

TCACCTTTTT CCAGAAAGAT TGCAATAAGT TCACCACGGG AGAGACCATC GCCCATGGTA 

AAGTGGGCCA GGGCATTTCG GCCTGGAGTA AGACCTTCTG TGCCCTTTTC GGCCCCTGGT 

TCCGTGCTAT TGAGAAGGCT ATTCTGGCCC TGCTCCCTCA GGGTGTGTTT TATGGGGATG 

CCTTTGATGA CACCGTCTTC TCGGCGCGTG TGGCCGCAGC AAAGGCGTCC ATGGTGTTTG 

AGAATGACTT TTCTGAGTTT GACTCCACCC AGAATAATTT TTCCCTGGGC CTAGAGTGTG 

CTATTATGGA GAAGTGTGGG ATGCCGAAGT GGCTCATCCG CTTGTACCAC CTTATAAGGT 



6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7171 

60 
120 
180 
240 
300 
360 
420 
480 
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CTGCGTGGftT CCTGCAGGCC CCGAAGGAGT CCCTGCGAGG GTGTTGGAAG AAACACTCCG 540 

GTGAGCCCG^ CACTCTTCTA TGGAATACTG TCTGGAACAT GGCCGTTATC ACCCATTGTT 600 

ACGATTTCCG fcGATTTGCAG GTGGCTGCCT TTAAAGGTGA TGATTCGATA GTGCTTTGCA 6G0 

GTGAGTACCG HAGAGTCCA GGGGCTGCTG TCCTGATTGC TGGCTGTGGC TTAAAGCTGA 720 

AGGTGGGTTT CCGICCGATT GGTTTGTATG CAGGTGTTGT GGTGACCCCC GGCCTTGGCG 780 

CGCTTCCCGA CGTCG^GCGC TTGTCCGGCC GGCTTACTGA GAAGAATTGG GGCCCTGGCC 840 

CTGAGCGGGC GGAGCA&CTC CGCCTTGCTG TGCG 874 



or a sequence complementary thereto. 



4. A protein\ which is (a) immunoreactive with 
antibodies present in individuals infected with 

20 enterically transmitted nonA/nonB hepatitis and (b) 
derived from a viral hepatitis agent whose genome 
contains a region whick\ is homologous to the 1.33 kb 
DNA EcoRI insert preserft^n^plasmid pTZXFl ( ET1 . 1 ) 
carried in coli strain BB4/\and having ATCC 

25 Deposit Nno. 67717 



U 



of claim 4, Which i^ encoded by 
n saiMl.33 kp ExsolCl insert. 



5 . The prote j 
a coding region wit! 



30 6. A protein ^li^h^is \ ( a ) im&mnoreactive with 

antibodies present iri\ individuals infected with 
enterically transmitted nonA/i|onB hepatitis and (b) 
encoded by genetic sequence 40^.3-2 or 406*4-2 or a 
fragment thereof. 

35 

7. A method of detecting infection by 
enterically transmitted nonA/nonB hepatitis viral 
agent in a test individual, comprising: 

providing a peptide antAgen which is (a) 
40 immunoreactive with antibodies present in individuals 
infected with enterically transmitted nonA/nonB 
hepatitis and (b) derived from a viral hepatitis agent 
whose genome contains a region which Vs homologous to 
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the 1,33 kb DnW EcoRI insert present in plasmid 
pTZKFl ( ET1 . 1 ) carried in E^ coli strain BB4 , and 
having ATCC deAosit no. 67717, 

reacting serum from the test individual with 

such antigen, an\ 

examining the antigen for the presence of 

bound antibody, 
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8« The method of claim 7, wherein the serum 
antibody is an IgM \r IgG antibody, or a mixture of 
both, the antigen prbvided is attached to a support , 
said reacting includes contacting such serum with the 
support and said examining includes reacting the 
support and bound seruki antibody with a reporter- 
labeled anti-human anti)bod^ 



9 • A kit for a&c 
antibodies which are 
transmitted nonA/nonfi 
a support w' 
peptide antigen which 
antibodies present i~n 




taining, the Presence of serum 
diagnostic of enAerlpally 
hepatitis infection, comprising 
th s\irf ap^e^^und \recombinant 
xs^tnf immunbreactive with 
ndivlduals infecqed with 



enterically transmitted non^/nonB vliral hepatitis 

agent and (b) derived Mrom a \ viral hepatitis agent 

whose genome contains akregioYi which is homologous to 

the 1,33 kb DNA EcoRI insert Aresent in plasmid 

pTZKFl(ETl* 1) carried in coli strain BB4, and 

having ATCC deposit no, 67717, Wd 

a reporter-labeled anti-human antibody* 

* \ 

10* A DNA fragment derived krom an enterically 
transmitted nonA/nonB viral hepatitis agent whose 
genome contains a region which is homologous to the 
1.33 kb DNA EcoRI insert present in\ plasmid 
pTZKFl ( ET1 • 1 ) carried in E^_ coli strain BB4 and having 
ATCC deposit no. 67717. 
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11. The fragment of claim 10, which is derived 
from said 1.33 kb WoRI insert. 

12. A DNA molecule comprising genetic sequence 
406.3-2 or 4 06.4-2 dp a fragment thereof, wherein said 
fragment comprises an least 12 consecutive 
nucleotides . 
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13. A DNA fragment derived from an enterically 
transmitted nonA/nonB ^iral hepatitis agent whose 

genome contains a regioh which is homologous to a DNA 
fragment within a first \sequence (SEQ ID NO.l): 

AGACCTGTCC CTGTTGCAGC TGTTCTaVcA CCCTGCCCCG AGCTCGAACA GGGCCTTCTC 60 

TACCTGCCCC AGGAGCTCAC CACCTGTGAT AGTGTCGTAA CATTTGAATT AACAGACATT 120 

GTGCACTGCC GCATGGCCGC CCCGAGCCA& CGCAAGGCCG TGCTGTCCAC ACTCGTGGGC 180 

CGCTACGGCG GTCGCACAAA GCTCTACAA^TTCCCAfeT CTGATGTTCG CGACTCTCTC 240 

GCCCGTTTTA TCCCGGCCAT TGGCqCC^TA O/VGGTTACA^ CTTGTGAATT GTACGAGCTA 300 

GTGGAGGCCA TGGTCGAGAA GGGCCJ^GAT G^jCTCCGCCG \CCJJGAfiCT TGATCTTTGC 360 

AACCGTGACG TGTCCAGGAT CACCTT^TTC CA^Mt^TT GT^ACAAGTT CACCACAGGT 420 

GAGACCATTG CCCATGGTAA AGTG£Gfft5AG GGCA\CTCGG CCT^GAGCAA GACCTTCTGC 480 

GCCCTCTTTG GCCCTTGGTT CCGCGdVlT GAGAA&GCTA TTCT&GCCCT GCTCCCTCAG 540 

GGTGTGTTTT ACGGTGATGC CTTTGATsW ACCGTC1TCT CGGCGGCTGT GGCCGCAGCA 600 

AAGGCATCCA TGGTGTTTGA GAATGACTTT TCTGAGtItG ACTCCACCCA GAATAACTTT 660 

TCTCTGGGTC TAGAGTGTGC TATTATGGAG GAGTGTGGGA TGCCGCAGTG GCTCATCCGC 720 

CTGTATCACC TTATAAGGTC TGCGTGGATC TTGCAGGCCQ CGAAGGAGTC TCTGCGAGGG 780 

TTTTGGAAGA AACACTCCGG TGAGCCCGGC ACTCTTCTAT GGAATACTGT CTGGAATATG 840 

GCCGTTATTA CCCACTGTTA TGACTTCCGC GATTTTCAGG TlaGCTGCCTT TAAAGGTGAT 900 

GATTCGATAG TGCTTTGCAG TGAGTATCGT CAGAGTCCAG GAGCTGCTGT CCTGATGGCC 960 

GGCTGTGGCT TGAAGTTGAA GGTAGATTTC CGCCCGATCG GTTTGTATGC AGGTGTTGTG 1020 

GTGGCCCCCG GCCTTGGCGC GCTCCCTGAT GTTGTGCGCT TCGCCGGCCG GCTTACCGAG 1080 

AAGAATTGGG GCCCTGGCCC TGAGCGGGCG GAGCAGCTCC GCCT^GCTGT TAGTGATTTC 1140 

96 
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CTCCGCAAGC TCACGAATGT AGCTCAGATG TGTGTGGATG TTGTTTCCCG TGTTTATGGG 

GTTTCCCCTG GACKGTTCA TAACCTGATT GGCATGCTAC AGGCTGTTGC TGATGGCAAG 

GCACATTTCA CTGAGTICAGT AAAACCAGTG CTCGA 

a second sequence (SEQ ID NO-5): 

TCGAGCACTG GTTTTACTGA CTCAGTGAAA TGTGCCTTGC CATCAGCAAC AGCCTGTAGC 

ATGCCAATCA GGTTATGAA& GAGTCCAGGG GAAACCCCAT AAACACGGGA AACAACATCC 

ACACACATCT GAGCTACATT VgTGAGCTTG CGGAGGAAAT CACTAACAGC GAGGCGGAGC 

TGCTCCGCCC GCTCAGGGCC AGGGCCCCAA TTCTTCTCGG TAAGCCGGCC GGCGAAGCGC 

ACAACATCAG GGAGCGCGCC AAQGCCGGGG GCCACCACAA CACCTGCATA CAAACCGATC 

GGGCGGAAAT CTACCTTCAA CTTQAAGCCA CAGCCGGCGA TCAGGACAGC AGCTCCTGGA 

CTCTGACGAT ACTCACTGCA AAGCffCTATC GAATCATCAC CTTTAAAGGC AGCCACCTGA 

AAATCGCGGA AGTCATAACA GTGGGTAATA ACGGCCATAT TCCAGACAGT ATTCCATAGA 

AGAGTGCCGG GCTCACCGGA GTGTTTcW CAAAACCCTC GCAGAGACTC CTTCGGGGCC 

TGCAAGATCC ACGCAGACCT TATAAp^TGA TAC^GCGGA TGAGCCACTG CGGCATCCCA 

CACTCCTCCA TAATAGCACA CTt^AGACCQ AGAGAA&AGT TATTCTGGGT GGAGTCAAAC 

TCAGAAAAGT CATTCTCAAA QACCATGGAT \aCCTTTGOjG CGGCC££A6C CGCCGAGAAG 



ACGGTGTCAT CAAAGGCATC 



CCGTAAAAC A^ACCCTG^CGGAGCAGGGC CAGAATAGCC 
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TTCTCAATAG CGCGGAACCA J^GGGCCAAAG^EGCGCAGA ^GGTCTTGCT CCAGGCCGAG 

ATGCCCTGGC CCACTTTACC JjOMfiCAATG GTC\lCACCTG T^GTGAACTT GTTACAATCT 

TTCTGGAAGA AGGTGATCCT GGACACGTCA CGG#GCAAA GAiCAAGCTC AAGGACGGCG 

GAGCCATCCT GGCCCTTCTC GA|CATGGCC TCCACfAGCT CGTACAATTC ACAAGTTGTA 

ACCTGTACGG GGCCAATGGC CGGGATAAAA CGGGCGAGAG AGTCGCGAAC ATCAGAGTGG 

GAAGCATTGT AGAGCTTTGT GCGACCGCCG TAGCGGCCCA CGAGTGTGGA CAGCACGGCC 

TTGCGCTGGC TCGGGGCGGC CATGCGGCAG TGCACAATGT CTGTTAATTC AAATGTTACG 

ACACTATCAC AGGTGGTGAG CTCCTGGGGC AGGTAGAGA^ GGCCCTGTTC GAGCTCGGGG 

CAGGGTGGTA GAACAGCTGC AACAGGGACA GGTCT 

a third sequence (SEQ ID N0.6)\: 
AGGCAGACCA CATATGTGGT CGATGCC ATGGAGGCCC ATCASTTTAT TAAGGCTCCT 

GGCATCACTA CTGCTATTGA GCAGGCTGCT CTAGCAGCGG CCA^CTCTGC CCTGGCGAAT 

97 
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1260 
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120 
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300 
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480 
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600 
660 
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GCTGTGGTAG 
CAACCTCGCC 
ATCCATAACG 
GCCCATCCCC 
GTTGGGCGTG 
CGGCGTTCCG 
TCTGGCTGTA 
CCATCTGATG 
CATCTTCCGC 
ATTCATGACG 
CACGATGTCT 
CTCGTTATCG 
CCGGAGCCAT 
TCGATCTTCG 
ACCTTCCATG 
GATGACCAAG 
GTCACTGTTG 
GCTGTTATCA 
ATATCCAAGG 
TACAGCTGGC 
TACGCCCAGT 
TTTGACGAGT 
TGCTGCTTCA 
GCCGTCGGCG 
GAGTCCGCCA 
CTCTACCAGG 
ACAGTAAAGG 
ACCTTTCGCA 



TTAGGCCTT l TCTCTCTCAC 
AGCTTGTTTlj CCGCCCCGAG 
AGCTGGAGCT\ TTACTGCCGC 
GCTCAATAAA \TGATAATCCT 
ATGTTCAGCG c\rGGTATACT 
CGCTGCGCGG GCTTCCCGCT 
ACTTTCCCGC CGAGACTGGC 
TCGCCGAGGC CATG1TCCGC 
CTGAGGTCCT GCTGCQCCCT 
GTAGGCGCGT TGTGGTgWg 
CCAACTTGCG CTCCTGGAfT 
AGCGGGTTAG GGCCATT^Gi 
CACCTATGCC tVat/ttCCT 
GCCCGGGTGG CAtCCCTTCC 
CTGTCCCTGC CCATATTTGJ 



CCTTTTGCTG CU 



rGTTTA 



GTACCCTTGT GGCTOATGAA 

CTGCCGCCTA CCTTACCATT 

GGATGCGTCG TCTGGAACGG 

TCTTCGAGAA GTCCGGCCGT 

GCAGGCGCTG GCTCTCCGCC 

CGGCCCCCTG CCATTGTAGG 

TGAAGTGGCT TGGTCAGGAG 

ACCAGGGTCA TGATAATGAA 

TTAGTGACAT ATCTGGGTCC 

CCCTCGATCT CCCCGCTGAG 

TCTCCCAGGT CGATGGGCGG 

CGTCGTTCGT TGACGGGGCG 
98 



CAGCAGATTG AGATCCTCAT TAACCTAATG 177 

GTTTTCTGGA ATCATCCCAT CCAGCGTGTC 237 

GCCCGCTCCG GCCGCTGTCT TGAAATTGGC 297 

AATGTGGTCC ACCGCTGCTT CCTCCGCCCT 357 

GCTCCCACTC GCGGGCCGGC TGCTAATTGC 417 

GCTGACCGCA CTTACTGCCT CGACGGGTTT 477 

ATCGCCCTCT ACTCCCTTCA TGATATGTCA 537 

CATGGTATGA CGCGGCTCTA TGCCGCCCTC 597 

GGCACATATC GCACCGCATC GTATTTGCTA 657 

TATGAGGGTG ATACTAGTGC TGGTTACAAC 717 

AgMfiCACCA AGGTTACCGG AGACCATCCC 777 

TGCCACTTO TTCTCTTGCT CACGGCAGCC 837 

^TACCCCCGGt\tAC£6A6GT CTATGTCCGA 897 

yATTCO&AScaTCATGCTC CACTAAGTCG 957 

fttCGTCTTA TGGTGTTCGG GGCCACCTTG 1017 

ATg\cCTACC TTCGiCGGCAT TAGCTACAAG 1077 

GGCTfiGAATG CCTCjTGAGGA CGCCCTCACA 1137 

TGCCA^CAGC GGTATCTCCG CACCCAGGCT 1197 

GAGCAT^CCC AGAAGTTTAT AACACGCCTC 1257 

G ATT AC AtC C CTGGCCGTCA GTTGGAGTTC 1317 

GGCTTTCAl\C TTGATCCACG GGTGTTGGTT 1377 

ACCGCGATCC^ GTAAGGCGCT CTCAAAGTTT 1437 

TGCACCTGCT \TCCTTCAGCC TGCAGAAGGC 1497 

GCCTATGAGG (^GTCCGATGT TGACCCTGCT 1557 

TATGTCGTCC CtGGCACTGC CCTCCAACCG 1617 

ATTGTGGCTC GCiJCGGGCCG GCTGACCGCC 1677 

ATCGATTGCG AGA^CCTTCT TGGTAACAAA 1737 

GTCTTAGAGA CCAA^TGGCCC AGAGCGCCAC 1797 
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AATCTCTCCT TCGATjGCCAG TCAGAGCACT ATGGCCGCTG 
GCCGCCTCTG CAGCTGGGCT GGAGGT6CGC TATGTTGCTG 
GTTTTTGCCC CCGGTGTTTC ACCCCGGTCA GCCCCCGGCG 
GCCCTATACA GGTTTAAECG TGAGGCCCAG CGCCATTCGC 
CATCCTGAGG GACTCATTGG CCTCTTCGCC CCGTTTTCGC 
GCTAATCCAT TCTGTGGCGa\gAGCACACTT TACACCCGTA 
GTCTCTAGTC CAGCCCGGCC TGACTTAGGT TTTATGTCTG 

gccgccacgc ctaccctggc ggAccctcta CCCCCCCCTG 

CCCTCTGCCC CGGCGCTTGC TGAGCCGGCT TCTGGCGCTA 

ACTCACCAGA cggcccggca ccgccgcctg ctcttcacct 

TTCGCCGGCT CGCTGTTCGA GTCGACATGC ACGTGGCTCG 
CACCGCCCTG GCGGCGGGCT TTGCCATtp^OTACCAAA 
GCTGCCTCTT TTGTGATGCG CGACG^CGOG GCCGc\tACA 



ATTCACGCTG TCGCCCCTGA TT 



TATCGGGAAA CTTGCTCCCG CttCGGCACC &CTG* 



TACCAGGTGC CGATCGGCCC CAGiTTTTj 



TAGGTTG\GAACATAV\CC 




CjTGCCAGA TGGfTTGAG^ 
GCACGG ACAGCGAAT( 
GCCGGC TGTCGGGTCA) 



GAGTTGTACC TTCCTGAGCT 
ACTCTCACTA TAACTGAGGA TG^ 
GCCACAGATG TCGGCCGGGC CTG^ 
CAGTTTACTG CAGGTGT.GCC TGGATCCGGC AAGTCCfeGCT 
GACGTTGTCG TGGTCCCGAC GCGTGAGTTG CGTAATGl 
GCTTTTACCC CGCATACTGC CGCCAGAGTC ACCCAGGG( 
GCTCCATCCC TCCCCCCTCA CCTGCTGCTG CTCCACATGi 
CTTCTTGGCG ACCCGAACCA GATCCCAGCC ATCGACTTTG 
GCCATCAGGC CCGACTTAGG CCCCACCTCC TGGTGGCATG 
GATGTATGCG AGCTCATCCG TGGTGCATAC CCCATGATCC 
CGTTCGTTGT TCTGGGGTGA GCCTGCCGTC GGGCAGAAAC 
AAGCCCGCCA ACCCCGGCTC AGTGACGGTC CACGAGGCGC 

99 



GCCCTTTCAG TCTCACCTAT 1857 

CCGGGCTTGA CCATCGGGCG 1917 

AGGTTACCGC CTTCTGCTCT 1977 

TGATCGGTAA CTTATGGTTC 2037 

CCGGGCATGT TTGGGAGTCG 2097 

CTTGGTCGGA GGTTGATGCC 2157 

AGCCTTCTAT ACCTAGTAGG 2217 

CACCGGACCC TTCCCCCCCT 2277 

CCGCCGGGGC CCCGGCCATA 2337 

ACCCGGATGG CTCTAAGGTA 2397 

TTAACGCGTC TAATGTTGAC 2457 

GGTACCCCGC CTCCTTTGAT 2517 

CACTAACCCC CCGGCCAATA 2577 

AAAGAGGCT TGAGGCTGCT 2637 

CGCTCCTCGG GACCGGCATA 2697 

GGAACCACCG CCCCGGGGAT 2757 

CCAATAGGCC GACCCGCCCG 2817 

TGGCCATCGA GCTTGACTCA 2877 

CCCCCGGCGT TGTTCAGTAC 2937 

CTATCACCCA AGCCGATGTG 2997 

GGCGCCGTCG CGGCTTTGCT 3057 

GCCGGGTTGT CATTGATGAG 3117 

AGCGGGCCGC CACCGTCCAC 3177 

GCACGCTGG GCTCGTCCCC 3237 

TACCCATCG CTGGCCTGCG 3297 

A^ACCACTAG CCGGGTTCTC 3357 

TA^TGTTCAC CCAGGCGGCC 3417 

AGGSCGCTAC CTACACGGAG 3477 
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ACCACTATTA TTGG£ACAGC AGATGCCCGG GGCCTTATTC AGTCGTCTCG GGCTCATGCC 3537 

ATTGTTGCTC TGACGKjGCCA CACTGAGAAG TGCGTCATCA TTGACGCACC AGGCCTGCTT 3597 

CGCGAGGTGG GCATCTQCGA TGCAATCGTT AATAACTTTT TCCTCGCTGG TGGCGAAATT 3657 

GGTCACCAGC GCCCATCaVt TATTCCCCGT GGCAACCCTG ACGCCAATGT TGACACCCTG 3717 

GCTGCCTTCC CGCCGTCTTGV CCAGATTAGT GCCTTCCATC AGTTGGCTGA GGAGCTTGGC 3777 

CACAGACCTG TCCCTGTTGC A&CTGTTCTA CCACCCTGCC CCGAGCTCGA ACAGGGCCTT 3837 

CTCTACCTGC CCCAGGAGCT CAOCACCTGT GATAGTGTCG TAACATTTGA ATTAACAGAC 3897 

ATTGTGCACT GCCGCATGGC CGCCtCGAGC CAGCGCAAGG CCGTGCTGTC CACACTCGTG 3957 

GGCCGCTACG GCGGTCGCAC AAAGCTCTAC AATGCTTCCC ACTCTGATGT TCGCGACTCT 4017 

CTCGCCCGTT TTATCCCGGC CATTGGVCC GTACAGGTTA CAACTTGTGA ATTGTACGAG 4077 

CTAGTGGAGG CCATGGTCGA GAAGGGC(^G GATGGCTCCG CCGTCCTTGA GCTTGATCTT 4137 

TGCAACCGTG ACGTGTCCAG GAT^CCT"^ tYQCAGAAAG ATTGTAACAA GTTCACCACA 4197 

GGTGAGACCA TTGCCCATGG T#AAGTGGGC\.CA6Gftf ATCT CGGCCTGGAG CAAGACCTTC 4257 

TGCGCCCTCT TTGGCCCT^G G|TTCCGCGCT ^TTGAGM^tTATTCTGGC CCTGCTCCCT 4317 

CAGGGTGTGT TTTACGGTGA TfGCCTTTGAT G^'/fCCGT^T TCTCGGC6GG TGTGGCCGCA 4377 

GCAAAGGCAT CCATGGTGTT ^GAtftGAC TTITCTGAgA TTGACTCCAC CCAGAATAAC 4437 

TTTTCTCTGG GTCTAGAGTG T&CTATTATG GAGgWtGTG GGATGCCGCA GTGGCTCATC 4497 

CGCCTGTATC ACCTTATAAG GTftTGCGTGG ATCTT^CAGG CfCCGAAGGA GTCTCTGCGA 4557 

GGGTTTTGGA AGAAACACTC CG§TGAGCCC GGCACT(\TTC TATGGAATAC TGTCTGGAAT 4617 

ATGGCCGTTA TTACCCACTG TTATGACTTC CGCGATTTYC AGGTGGCTGC CTTTAAAGGT 4677 

GATGATTCGA TAGTGCTTTG CAGTGAGTAT CGTCAGAGT^ CAGGAGCTGC TGTCCTGATC 4737 

GCCGGCTGTG GCTTGAAGTT GAAGGTAGAT TTCCGCCCGA VCGGTTTGTA TGCAGGTGTT 4797 

GTGGTGGCCC CCGGCCTTGG CGCGCTCCCT GATGTTGTGC GCTTCGCCGG CCGGCTTACC 4857 

GAGAAGAATT GGGGCCCTGG CCCTGAGCGG GCGGAGCAGC TCCGCCTCGC TGTTAGTGAT 4917 

TTCCTCCGCA AGCTCACGAA TGTAGCTCAG ATGTGTGTGG ATGTTGTTTC CCGTGTTTAT 4977 

GGGGTTTCCC CTGGACTCGT TCATAACCTG ATTGGCATGC TACAGjGCTGT TGCTGATGGC 5037 

AAGGCACATT TCACTGAGTC AGTAAAACCA GTGCTCGACT TGACAAATTC AATCTTGTGT 5097 

CGGGTGGAAT GA ATAACATGTC TTTTGCTGCG CCCATGGGTT CGCG^CCATG 5149 
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CGCCCTCGGC CTATTTTGTit GCTGCTCCTC ATGTTTTTGC CTATGCTGCC CGCGCCACCG 5209 

CCCGGTCAGC CGTCTGGCC& CCGTCGTGGG CGGCGCAGCG GCGGTTCCGG CGGTGGTTTC 5269 

TGGGGTGACC GGGTTGATTcVfCAGCCCTTC GCAATCCCCT ATATTCATCC AACCAACCCC 5329 

TTCGCCCCCG ATGTCACCGC T^CGGCCGGG GCTGGACCTC GTGTTCGCCA ACCCGCCCGA 5389 

CCACTCGGCT CCGCTTGGCG TGACCAGGCC CAGCGCCCCG CCGTTGCCTC ACGTCGTAGA 5449 

CCTACCACAG CTGGGGCCGC GCCGCTAA CCGCGGTCGC TCCGGCCCAT GACACCCCGC 5507 

CAGTGCCTGA TGTCGACTCC CGCGGlCGCCA TCTTGCGCCG GCAGTATAAC CTATCAACAT 5567 

CTCCCCTTAC CTCTTCCGTG GCCACCGGCA CTAACCTGGT TCTTTATGCC GCCCCTCTTA 5627 

GTCCGCTTTT ACCCCTTCAG GACGGCA&CA ATACCCATAT AATGGCCACG GAAGCTTCTA 5687 

ATTATGCCCA GTACCGGGTT GCCCGTGCOA CAATCCGTTA CCGCCCGCTG GTCCCCAATG 5747 

CTGTCGGCGG TTACGCCATC TCCATCTCAti TCTGGCCACA GACCACCACC ACCCCGACGT 5807 

CCGTTGATAT GAATTCAATA ACCTCGACGG Vt^TCQTAT TTTAGTCCAG CCCGGCATAG 5867 

CCTCTGAGCT TGTGATCCCA AGTGAGCGC^TmCACTATC&VTAACCAAGGC TGGCGCTCCG 5927 

TCGAGACCTC TGGGGTGGCT GAg4aG^AGG CTACCTCTGG tW(^AYG CTTTGCATAC 5987 

ATGGCTCACT CGTAAATTCC TATAOTAATA CACGCTAT^rakGCCCTC GGGCTGTTGG 6047 

ACTTTGCCCT TGAGCTTGAG TTTC|CAACC^/£cCCCGG TAAOftCCAAT AC6CGGGTCT 6107 

CCCGTTATTC CAGCACTGCT CGCCMfGCC TTCGTCGCGG TGCGGACGGG ACTGCCGAGC 6167 

TCACCACCAC GGCTGCTACC CGCTtUatGA AGGACCKTA TTTTAGTAGT ACTAATGGTG 6227 

TCGGTGAGAT CGGCCGCGGG ATAGCCCTCA CCCTGTTCM CCTt@$!TGAC ACTCTGCTTG 6287 

GCGGCCTGCC GACAGAATTG ATTTCGTCGG CTGGTGGCCA GCTGTTCTAC TCCCGTCCCG 6347 

TTGTCTCAGC CAATGGCGAG CCGACTGTTA AGTTGTATAC\ATCTGTAGAG AATGCTCAGC 6407 

AGGATAAGGG TATTGCAATC CCGCATGACA TTGACCTCGG AGAATCTCGT GTGGTTATTC 6467 

AGGATTATGA TAACCAACAT GAACAAGATC GGCCGACGCC TTCTCCAGCC CCATCGCGCC 6527 

CTTTCTCTGT CCTTCGAGCT AATGATGTGC TTTGGCTCTC TCTCACCGCT GCCGAGTATG 6587 

ACCAGTCCAC TTATGGCTCT TCGACTGGCC CAGTTTATGT TTCTGACTCT GTGACCTTGG 6647 

TTAATGTTGC GACCGGCGCG CAGGCCGTTG CCCGGTCGCT CGATmGACC AAGGTCACAC 6707 

TTGACGGTCG CCCCCTCTCC ACCATCCAGC AGTACTCGAA GACCTTCTTT GTCCTGCCGC 6767 

TCCGCGGTAA GCTCTCTTTC TGGGAGGCAG GCACAACTAA AGCCGGG8TAC CCTTATAATT 6827 
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ATAACACCAC TGCTAG&AC CAACTGCTTG TCGAGAATGC CGCCGGGCAC CGGGTCGCTA 

TTTCCACTTA CACCACTAGC CTGGGTGCTG GTCCCGTCTC CATTTCTGCG GTTGCCGTTT 

TAGCCCCCCA CTCTGCGCTA GCATTGCTTG AGGATACCTT GGACTACCCT GCCCGCGCCC 

ATACTTTTGA TGATTTCTGOy CCAGAGTGCC GCCCCCTTGG CCTTCAGGGC TGCGCTTTCC 

AGTCTACTGT CGCTGAGCTT MGCGCCTTA AGATGAAGGT GGGTAAAACT CGGGAGTTGT 

AG TTTATTTGCT TGTGCCCCCC\TTCTTTCTGT TGCTTATTTC TCATTTCTGC 

GTTCCGCGCT CCCTGA 

a fourth sequence (SEQ iD NO. 10): 

GCCATGGAGG CCCACCAGTT CATTAAGGCT CCTGGCATCA CTACTGCTAT TGAGCAAGCA 

GCTCTAGCAG CGGCCAACTC CGCCCTTSCG AATGCTGTGG TGGTCCGGCC TTTCCTTTCC 

CATCAGCAGG TTGAGATCCT TATAAATCIC AT£CAAjQCTC GGCAGCTGGT GTTTCGTCCT 

GAGGTTTTTT GGAATCACCC GATTCAAGCTl GTTATACAtX ATGAGCTTGA GCAGTATTGC 

CGTGCTCGCT CGGGTCGCTG CC1TGAGATT GGAGCCCACC SACGCTCC^PKTAATGATAAT 

CCTAATGTCC TCCATCGCTG CTTfflCTCCAC C^GTCGGCC^SMTGTTCA GCGCTGGTAC 

ACAGCCCCGA CTAGGGGACC TGC4GCGAAC TGMcCGCT CGGCACTTCG TGGTCTGCCA 



CCAGCCGACC GCACTTACTG TTT 
GGTGTGGCTC TCTATTCTCT CCA 



SGC TTTGCCGGCT GCOGTTTTGC CGCCGAGACT 

3ACTTG CAGCCQGCTG ATGTTGCCGA GGCGATGGCT 

CGCCACGGCA TGACCCGCCT TTAT&CAGCT TTCCACTTGC CTCCAGAGGT GCTCCTGCCT 

CCTGGCACCT ACCGGACATC ATCCTACTTG CTGATCCAGG ATGGTAAGCG CGCGGTTGTC 

ACTTATGAGG GTGACACTAG CGCCGGTTAC AATCATGATg\ TTGCCACCCT CCGCACATGG 

ATCAGGACAA CTAAGGTTGT GGGTGAACAC CCTTTGGTGA TCGAGCGGGT GCGGGGTATT 

GGCTGTCACT TTGTGTTGTT GATCACTGCG GCCCCTGAGC COTCCCCGAT GCCCTACGTT 

CCTTACCCGC GTTCGACGGA GGTCTATGTC CGGTCTATCT TTGfeGCCCGG CGGGTCCCCG 

TCGCTGTTCC CGACCGCTTG TGCTGTCAAG TCCACTTTTC ACGCfcGTCCC CACGCACATC 

TGGGACCGTC TCATGCTCTT TGGGGCCACC CTCGACGACC AGGCGTTTTG CTGCTCCAGG 

CTTATGACGT ACCTTCGTGG CATTAGCTAT AAGGTAACTG TGGGTQCCCT GGTCGCTAAT 

GAAGGCTGGA ATGCCACCGA GGATGCGCTC ACTGCAGTTA TTACGGCGGC TTACCTCACA 

ATATGTCATC AGCGTTATTT GCGGACCCAG GCGATTTCTA AGGGCATpCG CCGGCTTGAG 
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6947 
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7067 
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7179 
7195 

60 
120 
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CTTGAACATG CTOAGAAATT TATTTCACGC CTCTACAGCT GGCTATTTGA GAAGTCAGGT 1260 

CGTGATTACA TCCcVgGCCG CCAGCTGCAG TTCTACGCTC AGTGCCGCCG CTGGTTATCT 1320 

GCCGGGTTCC ATCTCSACCC CCGCACCTTA GTTTTTGATG AGTCAGTGCC TTGTAGCTGC 1380 

CGAACCACCA TCCGGCGGAT CGCTGGAAAA TTTTGCTGTT TTATGAAGTG GCTCGGTCAG 1440 

GAGTGTTCTI GTTTCCTCiSA GCCCGCCGAG GGGCTGGCGG GCGACCAAGG TCATGACAAT 1500 

GAGGCCTATG AAGGCTCTGa\ TGTTGATACT GCTGAGCCTG CCACCCTAGA CATTACAGGC 1560 

TCATACATCG TGGATGGTCG GJTTCTGCAA ACTGTCTATC AAGCTCTCGA CCTGCCAGCT 1620 

GACCTGGTAG CTCGCGCAGC CCEACTGTCT GCTACAGTTA CTGTTACTGA AACCTCTGGC 1680 

CGTCTGGATT GCCAAACAAT ^tIgGCAAT AAGACTTTTC TCACTACCTT TGTTGATGGG 1740 

GCACGCCTTG AGGTTAACGG GCCTGAGCAG CTTAACCTCT CTTTTGACAG CCAGCAGTGT 1800 

AGTATGGCAG CCGGCCCGTT TTGCCtWc TATGCTGCCG TAGATGGCGG GCTGGAAGTT 1860 

CATTTTTCCA CCGCTGGCCT CGAGAGpC^T^HGTTTTCC CCCCTGGTAA TGCCCCGACT 1920 

GCCCCGCCGA GTGAGGTCAC CGCGTTCTgV TCAGCTCTTT ATAGGCACAA CCGGCAGAGC 1980 

CAGCGCCAGT CGGTTATTGG\TAGTTTGTGg\cTGCACCOTG AAGGTTTGCT CGGCCTGTTC 2040 

CCGCCCTTTT CACCCGGGCA TGAGTGGCGG TtTGCTAACG CATTJl^CGG CGAGAGCACG 2100 

CTCTACACCC GCACTTGGTC q\CAATTACA GA^AC^€*Cf\T AACTGTCGG GCTAATTTCC 2160 

GGTCA1TTGG ATGCTGCTCC CWCTCgGGT GGG0CACCTG C&ACTGCCAC AGGCCCTGCT 2220 

GTAGGCTCGT CTGACTCTCC AGWCCCTGAC CCGCtWtG ATfiTTACAGA TGGCTCACGC 2280 

CCCTCTGGGG CCCGTCCGGC TGgVVcCAAC CCGAAT&GCG TTOCGCAGCG CCGCTTACTA 2340 

CACACCTACC CTGACGGCGC TAAGATCTAT GTCGGCTCCA TTTTCGAGTC TGAGTGCACC 2400 

TGGCTTGTCA ACGCATCTAA CGCCGGCCAC CGCCCTGGTG GCGGGCTTTG TCATGCTTTT 2460 

TTTCAGCGTT ACCCTGATTC GTTTGACGCC ACCAAGTTTG TGATGCGTGA TGGTCTTGCC 2520 

GCGTATACCC TTACACCCCG GCCGATCATT CATGCGGTGG C&CCGGACTA TCGATTGGAA 2580 

CATAACCCCA AGAGGCTCGA GGCTGCCTAC CGCGAGACTT GCGCCCGCCG AGGCACTGCT 2640 

GCCTATCCAC TCTTAGGCGC TGGCATTTAC CAGGTGCCTG TTAGTOTGAG TTTTGATGCC 2700 

TGGGAGCGGA ACCACCGCCC GTTTGACGAG CTTTACCTAA CAGAGCTIGGC GGCTCGGTGG 2760 

TTTGAATCCA ACCGCCCCGG TCAGCCCACG TTGAACATAA CTGAGGATAC CGCCCGTGCG 2820 

GCCAACCTGG CCCTGGAGCT TGACTCCGGG AGTGAAGTAG GCCGCGCATG\TGCCGGGTGT 2880 
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AAAGTCGAGC CTGGCGTTGT GCGGTATCAG 

TCAAAGTCCG TGCAACAGGC GGATGTGGAT 

AACGCTTGGC GGCGCCGGGG CTTTGCGGCA 

AGCGGCCGTA GGGTTGtW TGATGAGGCC 

catatgcagc gtgctgcatvc tgtgcacctc 
gattttgagc acaccggtcAgattccagca 
tggcatgtca cccaccgttg ccctgcagat 
aaaatccaga ctacaagtaa ggigctccgt 
cagaagctag tgttcacaca ggcigctaag 
gaggcccagg gtgccacttt taccactaca 
ctcatacagt cctcccgggc tcacgctata 
gttatacttg actctcccgg cctgttgcgt 
aatttcttcc tttcgggtgg cgaggtt- 
aaccctgacc gcaatgttga cgtdft/tgcg 



TTCCATCAGC TTGCTGAGGA GCTG( 
CCCTGCCCTG AGCTTGAGCA GGGCC 
AGTGTTGTGA CATTTGAGCT AACT( 
AGGAAAGCTG TTTTGTCCAC GCTGGTA^GC 
GCGGGTCACA CCGATGTCCG CGCCTCCCTT 
ACTGCCACCA CCTGTGAACT CTTTGAGCTT 
GGTTCAGCCG TCCTCGAGTT GGATTTGTGC 
CAGAAGGATT GTAACAAGTT CACGACCGGC 
GGTATCTTCC GCTGGAGTAA GACGTTTTGT 
GAGAAGGCTA TTCTATCCCT TTTACCACAA 
TCAGTATTCT CTGCTGCCGT GGCTGGCGCC 
TCTGAGTTTG ACTCGACTCA GAATAACTTT 
GAGTGTGGTA TGCCCCAGTG GCTTGTCAGG 
CTGCAGGCCC CAAAAGAGTC TTTGAGAGGG 

104 



TTTACAGCCG 
GTTGTTGTTG 
TTCACTCCGC 
CCTTCGCTCC 
CTTGGGGACC 
ATACGGCCGG 
GTCTGTGAGT 
TCCCTTTTCT 
GCCGCGCACC 
ACTATAATTG 
GTTGCTCTCA 
GAGGTGGGTA 
CACCAGAGAC 

GTTTCCA 
CGGCCGGC£ 
CTGCCAC 
GTGCAcWc 
CGGTATGGCA 
GCGCGCTTTi 
GTAGAGGCGA 
AGCCGAGATG 
GAGACAATTG 
GCCCTGTTTG 
GCTGTGTTCT 
AGCCATGCCA 
TCCCTAGGTC 
TTGTACCATG 
TTCTGGAAGA 




2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
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agcVtgctct ggaatacggt gtggaacatg gcaatcattg cccattgcta tgagttccgg 

gacctcxagg ttgccgcctt caagggcgac gactcggtcg tcctctgtag tgaataccgc 

cagagcccag GCGCCGGTTC GCTTATAGCA GGCTGTGGTT TGAAGTTGAA GGCTGACTTC 

cggccgaVtg GGCTGTATGC CGGGGTTGTC GTCGCCCCGG GGCTCGGGGC CCTACCCGAT 

gtcgttcga\ TCGCCGGACG GCTTTCGGAG AAGAACTGGG GGCCTGATCC GGAGCGGGCA 

GAGCAGCTCC WtCGCCGT GCAGGATTTC CTCCGTAGGT TAACGAATGT GGCCCAGATT 

TGTGTTGAGG TgVtGTCTAG AGTTTACGGG GTTTCCCCGG GTCTGGTTCA TAACCTGATA 

GGCATGCTCC AGAcVaTTGG TGATGGTAAG GCGCATTTTA CAGAGTCTGT TAAGCCTATA 

CTTGACCTTA CACACt\aAT TATGCAC'CGG TCTGAATGAA TAACATGTGG TTTGCTGCGC 

CCATGGGTTC GCCACCAtW GCCCTAGGCC TCTTTTGCTG TTGTTCCTCT TGTTTCTGCC 

TATGTTGCCC GCGCCACCGa\cCGGTCAGCC GTCTGGCCGC CGTCGTGGGC GGCGCAGCGG 

CGGTACCGGC GGTGGTTTCT G^GTG^CG^GGTt^ATTCT GAGCCCTTCG CAATCCCCTA 

TATTCATCCA ACCAACCCCT TTGOCCCAGA CGTTGCCGCT GCGTCCGGGT CTGGACCTCG 

CCTTCGCCAA CCAGCCCGGC cWttgWc CACTTGGCGA GATCAGGCCC AGCGCCCCTC 
CGCTGCCTCC CGTCGCCGAC CTGCCACAGQ CGGGGCTGCG\ GCGGTtiACGG CTGTGGCGCC 



TGCCCATGAC ACCTCACCCG TCf 
GTATAATTTG TCTACTTCAC CD 



SGGACGT T6AJ?CtCGC SGTGCAATTC TACGCCGCCA 
TCATC CTCYGTGGCC TGTGGCACTA ATTTAGTCCT 
GTATGCAGCC CCCCTTAATC CGCftTtTGCC GCTGCAGGAC GGTftCTAATA CTCACATTAT 
GGCCACAGAG GCCTCCAATT ATGCA^AGTA CCGGGTTGCC CGC&uTACTA TCCGTTACCG 
GCCCCTAGTG CCTAATGCAG TTGGAGGCTA TGCTATATCC\ATTTCTTTCT GGCCTCAAAC 
AACCACAACC CCTACATCTG TTGACATGAA TTCCATTACT TCCACTGATG TCAGGATTCT 
TGTTCAACCT GGCATAGCAT CTGAATTGGT CATCCCAAGC GAGOGCCTTC ACTACCGCAA 
TCAAGGTTGG CGCTCGGTTG AGACATCTGG TGTTGCTGAG GAGGAAGCCA CCTCCGGTCT 
TGTCATGTTA TGCATACATG GCTCTCCAGT TAACTCCTAT ACCAATACCC CTTATACCGG 
TGCCCTTGGC TTACTGGACT TTGCCTTAGA GCTTGAGTTT CGCAATCTck CCACCTGTAA 
CACCAATACA CGTGTGTCCC GTTACTCCAG CACTGCTCGT CACTCCGCCC GAGGGGCCGA 
CGGGACTGCG GAGCTGACCA CAACTGCAGC CACCAGGTTC ATGAAAGATC TCOACTTTAC 
CGGCCTTAAT GGGGTAGGTG AAGTCGGCCG CGGGATAGCT CTAACATTAC TTAACGJTGC 
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tgacacgctcVtcggcgggc tcccgacaga attaatttcg tcggctggcg ggcaactgtt 

TTATTCCCGC cXgGTTGTCT CAGCCAATGG CGAGCCAACC GTGAAGCTCT ATACATCAGT 
ggagaatgct cag\aggata AGGGTGTTGC TATCCCCCAC GATATCGATC TTGGTGATTC 
GCGTGTGGTC ATTCAGGATT ATGACAACCA GCATGAGCAG GATCGGCCCA CCCCGTCGCC 
TGCGCCATCT CGGCCT AlTT CTGTTCTCCG AGCAAATGAT GTACTTTGGC TGTCCCTCAC 
TGCAGCCGAG TATGACCAGJ CCACTTACGG GTCGTCAACT GGCCCGGTTT ATATCTCGGA 
CAGCGTGACT TTGGTGAATGVTTGCGACTGG CGCGCAGGCC GTAGCCCGAT CGCTTGACTG 
GTCCAAAGTC ACCCTCGACG GfSCGGCCCCT CCCGACTGTT GAGCAATATT CCAAGACATT 
CTTTGTGCTC CCCCTTCGTG GCaVgCTCTC CTTTTGGGAG GCCGGCACAA CAAAAGCAGG 
TTATCCTTAT AATTATAATA CTAckdAG TGACCAGATT CTGATTGAAA ATGCTGCCGG 
CCATCGGGTC GCCATTTCAA CCTATA(\CAC CAGGCTTGGG GCCGGTCCGG TCGCCATTTC 
TGCGGCCGCG GTTTTGGCTC CACGCTCck CCTGGCTCTG CTGGAGGATA CTTTTGATTA 
TCCGGGGCGG GCGCACACAT TTGATGACTt\.CT^CTGAA TGCOGCGCTT TAGGCCTCCA 
GGGTTGTGCT TTCCAGTCAA CTGTCSCTGA^WcCAGCGC CTTAAAGTTA AGGTGGGTAA 

aactcgggag ttgtagttta tttggctotg cccmxtact TATATCTGCT gatttccttt 



ATTTCCTTTT TCTCGGTCCC GCGCTCC 

or a fifth sequence 
CGGGCCCCGT ACAGGTCACA ACCTGTG 



[TG A 

SEQ 10^0. 12) 

iGj^TfiTACGAGCTV AGTGGAGGfcC ATGGTCGAGA 



AAGGCCAGGA TGGCTCCGCC GTCCTTGAGC VcGATCTCTG CAACCGTGAC GTGTCCAGGA 
TCACCTTTTT CCAGAAAGAT TGCAATAAGT TCACCACGGG AGAGACCATt GCCCATGGTA 
AAGTGGGCCA GGGCATTtCG GCCTGGAGTA AGACCTTCTG TGCCcYlTTC GGCCCCTGGT 
TCCGTGCTAT TGAGAAGGCT ATTCTGGCCC TGCTCCCTCA GGGTGTgVtT TATGGGGATG 
CCTTTGATGA CACCGTCTTC TCGGCGCGTG TGGCCGCAGC AAAGGCGTc\ ATGGTGTTTG 
AGAATGACTT TTCTGAGTTT GACTCCACCC AGAATAATTT TTCCCTGGGC VtAGAGTGTG 
CTATTATGGA GAAGTGTGGG ATGCCGAAGT GGCTCATCCG CTTGTACCAC CTJATAAGGT 
CTGCGTGGAT CCTGCAGGCC CCGAAGGAGT CCCTGCGAGG GTGTTGGAAG AAaWtCCG 
GTGAGCCCGG CACTCTTCTA TGGAATACTG TCTGGAACAT GGCCGTTATC ACCCATTGTT 
ACGATTTCCG CGATTTGCAG GTGGCTGCCT TTAAAGGTGA TGATTCGATA GTGCTTT^CA 

GTGAGTACCG TCAGAGTCCA GGGGCTGCTG TCCTGATTGC TGGCTGTGGC TTAAAGCTGA 
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6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7171 

60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



AGGXGGGTTT CCGTCCGATT GGTTTGTATG CAGGTGTTGT GGTGACCCCC GGCCTTGGCG 780 

CGCTt\cCGA CGTCGTGCGC TTGTCCGGCC GGCTTACTGA GAAGAATTGG GGCCCTGGCC 840 

CTGAGCG&GC GGAGCAGCTC CGCCTTGCTG TGCG 874 

or a sequence complementary thereto . 
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14* A ki\t comprising, in a container or separate 
containers, a pair of single-strand primers derived 
from nonhomologous regions of opposite strands of a 
DNA duplex fragmeVt derived from an enterically 
transmitted viral hepatitis agent whose genome 
contains a region wnich is homologous to the 1*33 kb 
DNA EcoRI insert present in plasmid pTZKFl ( ET1 • 1 ) 
carried in coli strain BB4 and having ATCC deposit 
no. 67717. 



15. The kit of clarc 
opposite strands of the 
plasmid. 



r5;x which are derived from 
>RI duplex insert in said 



16 ♦ A method f oi 
enterically transmittec 



detecting tfte^presence of an 
nonA/ncWB ^hepatitis viral 
agent in a biological s amp le^^ompri sing 

preparing a mixture\of duplex DI^A fragments 
derived from the sample, 

denaturing the duplet fragments, 

adding to the denatured DNA fragments, a pair of 
single-strand primers derived from nonhomologous 
regions of opposite strands of a DNA duplex fragment 
derived from an enterically transmitted^ viral 
hepatitis agent whose genome contains a region which 
is homologous to the 1.3 3 kb DNA EcoRI irvsert present 
in plasmid pTZKFl ( ET1 . 1 ) carried in E_^ col\j. strain 
BB4, and having ATCC deposit no. 67 717, 

hybridizing said primers to homo logousV sequence 
region of opposite strands of such duplex DNA\ 
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fragments derived from enterically transmitted 
nonA/ nonB hepatitis agent, 

reaating the primed fragment strands with DNA 
polymerase \in the presence of DNA nucleotides, to form 
5 new DNA duplexes containing the primer sequences, and 

repeating said denaturing, adding, hybridizing 
and reacting s\teps , until a desired degree of 
amplification af sequences is achieved, 

10 17, The method of claim 16, wherein the primers 

are derived from opposite strands of the EcoRI duplex 
insert in said plasmid. 
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18. The method Y>f claim 16, for detecting the 
presence of viral agent in a sample of cultured cells 
infected with the agent\ 

19, A vaccine for inmunizAncL an individual 
against enterically transmitted nonA>nonB hepatitis 
viral agent comprising ,* inr a pharmacologically 
acceptable adjuvant, a recombinant protein derived 



l/smitted nonA/ nonA ^viral 
lome CQntains^a region which 
kb DNA^ficoRI insert present 
in plasmid pTZ-RFl(ETl . 1)\ Varied in E, col^. strain 
BB4 , and having ATCC deposit no, 6,7717 



from an enterically tra 
hepatitis agent whose gd 
is homologous to the 1 . 3 
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20. The vaccine of cla&m 19, therein the protein 
is derived from the EcoRI insert in s^aid plasmid. 

21. A vaccine for immunizing an individual 
against HEV comprising, in a pharmacologically 
acceptable adjuvant, a protein encoded bv genetic 
sequence 406.3-2 or 406.4-2 or a f ragment\t hereof . 

22. In a method of isolating an enterically 
transmitted nonA/nonB viral agent or a nucleUc acid 
fragment produced by the agent, an improvement which 
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comprises: utilizing, as a source of said agent, bile 
obtained from a human or cynomolgus monkey having an 
active infection of enterically transmitted non-A/non- 
B hepatitis . 



23. The method 
obtained from an in 



24. Human pol] 
10 human immunized with\ 
enterically transmil 
agent whose genome cc 
homologous to the 1.3 3 kb 
plasmid pTZKFl ( ET1 . 1 ) carrj 
15 and having ATCC deposit no, 




22, wherein the bile is 
mo 1 gus monkey . 



s0rum obtained from a 
(derived form an 
on-B viral hepatitis 
egion which is 
coRI insert present in 
in E. coli strain BB4 
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